propy.QuasiSequenceOrder

Compute the quasi sequence order descriptors based on the given protein sequence. We can obtain two types of descriptors: Sequence-order-coupling number and quasi-sequence-order descriptors. Two distance matrixes between 20 amino acids are employed.

References

[1](1, 2) Kuo-Chen Chou. Prediction of Protein Subcellar Locations by Incorporating Quasi-Sequence-Order Effect. Biochemical and Biophysical Research Communications 2000, 278, 477-483.
[2]Kuo-Chen Chou and Yu-Dong Cai. Prediction of Protein sucellular locations by GO-FunD-PseAA predictor, Biochemical and Biophysical Research Communications, 2004, 320, 1236-1239.
[3]Gisbert Schneider and Paul wrede. The Rational Design of Amino Acid Sequences by Artifical Neural Networks and Simulated Molecular Evolution: Do Novo Design of an Idealized Leader Cleavge Site. Biophys Journal, 1994, 66, 335-344.
propy.QuasiSequenceOrder.GetAAComposition(ProteinSequence: str) → Dict[str, float][source]

Calculate the composition of Amino acids for a given protein sequence.

Parameters:ProteinSequence (str) – a pure protein sequence
Returns:result – contains the composition of 20 amino acids.
Return type:Dict[str, float]

Examples

>>> from propy.GetProteinFromUniprot import GetProteinSequence
>>> from propy.AAComposition import CalculateAAComposition
>>> protein = GetProteinSequence(ProteinID="Q9NQ39")
>>> result = CalculateAAComposition(protein)
propy.QuasiSequenceOrder.GetQuasiSequenceOrder(ProteinSequence: str, maxlag: int = 30, weight: float = 0.1) → Dict[Any, Any][source]

Compute quasi-sequence-order descriptors for a given protein.

See [1] for details.

Parameters:
  • ProteinSequence (str) – a pure protein sequence
  • maxlag (int, optional (default: 30)) – the maximum lag and the length of the protein should be larger than maxlag
  • weight (float, optional (default: 0.1)) – a weight factor. Please see reference 1 for its choice.
Returns:

result – contains all quasi-sequence-order descriptors

Return type:

Dict

Examples

>>> from propy.GetProteinFromUniprot import GetProteinSequence
>>> protein = GetProteinSequence(ProteinID="Q9NQ39")
>>> result = GetQuasiSequenceOrder(protein)
propy.QuasiSequenceOrder.GetQuasiSequenceOrder1(ProteinSequence: str, maxlag: int = 30, weight: float = 0.1, distancematrix=None)[source]

Compute the first 20 quasi-sequence-order descriptors for a given protein sequence.

Parameters:
  • ProteinSequence (str) – a pure protein sequence
  • maxlag (int, optional (default: 30)) – the maximum lag and the length of the protein should be larger

Examples

>>> from propy.GetProteinFromUniprot import GetProteinSequence
>>> protein = GetProteinSequence(ProteinID="Q9NQ39")
>>> result = GetQuasiSequenceOrder1(protein)

see GetQuasiSequenceOrder() for the choice of parameters.

propy.QuasiSequenceOrder.GetQuasiSequenceOrder1Grant(ProteinSequence: str, maxlag: int = 30, weight: float = 0.1, distancematrix={'AA': 0, 'AC': 195, 'AD': 126, 'AE': 107, 'AF': 113, 'AG': 60, 'AH': 86, 'AI': 94, 'AK': 106, 'AL': 96, 'AM': 84, 'AN': 111, 'AP': 27, 'AQ': 91, 'AR': 112, 'AS': 99, 'AT': 58, 'AV': 64, 'AW': 148, 'AY': 112, 'CA': 195, 'CC': 0, 'CD': 154, 'CE': 170, 'CF': 205, 'CG': 159, 'CH': 174, 'CI': 198, 'CK': 202, 'CL': 198, 'CM': 196, 'CN': 139, 'CP': 169, 'CQ': 154, 'CR': 180, 'CS': 112, 'CT': 149, 'CV': 192, 'CW': 215, 'CY': 194, 'DA': 126, 'DC': 154, 'DD': 0, 'DE': 45, 'DF': 177, 'DG': 94, 'DH': 81, 'DI': 168, 'DK': 101, 'DL': 172, 'DM': 160, 'DN': 23, 'DP': 108, 'DQ': 61, 'DR': 96, 'DS': 65, 'DT': 85, 'DV': 152, 'DW': 181, 'DY': 160, 'EA': 107, 'EC': 170, 'ED': 45, 'EE': 0, 'EF': 140, 'EG': 98, 'EH': 40, 'EI': 134, 'EK': 56, 'EL': 138, 'EM': 126, 'EN': 42, 'EP': 93, 'EQ': 29, 'ER': 54, 'ES': 80, 'ET': 65, 'EV': 121, 'EW': 152, 'EY': 122, 'FA': 113, 'FC': 205, 'FD': 177, 'FE': 140, 'FF': 0, 'FG': 153, 'FH': 100, 'FI': 21, 'FK': 102, 'FL': 22, 'FM': 28, 'FN': 158, 'FP': 114, 'FQ': 116, 'FR': 97, 'FS': 155, 'FT': 103, 'FV': 50, 'FW': 40, 'FY': 22, 'GA': 60, 'GC': 159, 'GD': 94, 'GE': 98, 'GF': 153, 'GG': 0, 'GH': 98, 'GI': 135, 'GK': 127, 'GL': 138, 'GM': 127, 'GN': 80, 'GP': 42, 'GQ': 87, 'GR': 125, 'GS': 56, 'GT': 59, 'GV': 109, 'GW': 184, 'GY': 147, 'HA': 86, 'HC': 174, 'HD': 81, 'HE': 40, 'HF': 100, 'HG': 98, 'HH': 0, 'HI': 94, 'HK': 32, 'HL': 99, 'HM': 87, 'HN': 68, 'HP': 77, 'HQ': 24, 'HR': 29, 'HS': 89, 'HT': 47, 'HV': 84, 'HW': 115, 'HY': 83, 'IA': 94, 'IC': 198, 'ID': 168, 'IE': 134, 'IF': 21, 'IG': 135, 'IH': 94, 'II': 0, 'IK': 102, 'IL': 5, 'IM': 10, 'IN': 149, 'IP': 95, 'IQ': 109, 'IR': 97, 'IS': 142, 'IT': 89, 'IV': 29, 'IW': 61, 'IY': 33, 'KA': 106, 'KC': 202, 'KD': 101, 'KE': 56, 'KF': 102, 'KG': 127, 'KH': 32, 'KI': 102, 'KK': 0, 'KL': 107, 'KM': 95, 'KN': 94, 'KP': 103, 'KQ': 53, 'KR': 26, 'KS': 121, 'KT': 78, 'KV': 97, 'KW': 110, 'KY': 85, 'LA': 96, 'LC': 198, 'LD': 172, 'LE': 138, 'LF': 22, 'LG': 138, 'LH': 99, 'LI': 5, 'LK': 107, 'LL': 0, 'LM': 15, 'LN': 153, 'LP': 98, 'LQ': 113, 'LR': 102, 'LS': 145, 'LT': 92, 'LV': 32, 'LW': 61, 'LY': 36, 'MA': 84, 'MC': 196, 'MD': 160, 'ME': 126, 'MF': 28, 'MG': 127, 'MH': 87, 'MI': 10, 'MK': 95, 'ML': 15, 'MM': 0, 'MN': 142, 'MP': 87, 'MQ': 101, 'MR': 91, 'MS': 135, 'MT': 81, 'MV': 21, 'MW': 67, 'MY': 36, 'NA': 111, 'NC': 139, 'ND': 23, 'NE': 42, 'NF': 158, 'NG': 80, 'NH': 68, 'NI': 149, 'NK': 94, 'NL': 153, 'NM': 142, 'NN': 0, 'NP': 91, 'NQ': 46, 'NR': 86, 'NS': 46, 'NT': 65, 'NV': 133, 'NW': 174, 'NY': 143, 'PA': 27, 'PC': 169, 'PD': 108, 'PE': 93, 'PF': 114, 'PG': 42, 'PH': 77, 'PI': 95, 'PK': 103, 'PL': 98, 'PM': 87, 'PN': 91, 'PP': 0, 'PQ': 76, 'PR': 103, 'PS': 74, 'PT': 38, 'PV': 68, 'PW': 147, 'PY': 110, 'QA': 91, 'QC': 154, 'QD': 61, 'QE': 29, 'QF': 116, 'QG': 87, 'QH': 24, 'QI': 109, 'QK': 53, 'QL': 113, 'QM': 101, 'QN': 46, 'QP': 76, 'QQ': 0, 'QR': 43, 'QS': 68, 'QT': 42, 'QV': 96, 'QW': 130, 'QY': 99, 'RA': 112, 'RC': 180, 'RD': 96, 'RE': 54, 'RF': 97, 'RG': 125, 'RH': 29, 'RI': 97, 'RK': 26, 'RL': 102, 'RM': 91, 'RN': 86, 'RP': 103, 'RQ': 43, 'RR': 0, 'RS': 110, 'RT': 71, 'RV': 96, 'RW': 101, 'RY': 77, 'SA': 99, 'SC': 112, 'SD': 65, 'SE': 80, 'SF': 155, 'SG': 56, 'SH': 89, 'SI': 142, 'SK': 121, 'SL': 145, 'SM': 135, 'SN': 46, 'SP': 74, 'SQ': 68, 'SR': 110, 'SS': 0, 'ST': 58, 'SV': 124, 'SW': 177, 'SY': 144, 'TA': 58, 'TC': 149, 'TD': 85, 'TE': 65, 'TF': 103, 'TG': 59, 'TH': 47, 'TI': 89, 'TK': 78, 'TL': 92, 'TM': 81, 'TN': 65, 'TP': 38, 'TQ': 42, 'TR': 71, 'TS': 58, 'TT': 0, 'TV': 69, 'TW': 128, 'TY': 92, 'VA': 64, 'VC': 192, 'VD': 152, 'VE': 121, 'VF': 50, 'VG': 109, 'VH': 84, 'VI': 29, 'VK': 97, 'VL': 32, 'VM': 21, 'VN': 133, 'VP': 68, 'VQ': 96, 'VR': 96, 'VS': 124, 'VT': 69, 'VV': 0, 'VW': 88, 'VY': 55, 'WA': 148, 'WC': 215, 'WD': 181, 'WE': 152, 'WF': 40, 'WG': 184, 'WH': 115, 'WI': 61, 'WK': 110, 'WL': 61, 'WM': 67, 'WN': 174, 'WP': 147, 'WQ': 130, 'WR': 101, 'WS': 177, 'WT': 128, 'WV': 88, 'WW': 0, 'WY': 37, 'YA': 112, 'YC': 194, 'YD': 160, 'YE': 122, 'YF': 22, 'YG': 147, 'YH': 83, 'YI': 33, 'YK': 85, 'YL': 36, 'YM': 36, 'YN': 143, 'YP': 110, 'YQ': 99, 'YR': 77, 'YS': 144, 'YT': 92, 'YV': 55, 'YW': 37, 'YY': 0})[source]

Compute the first 20 quasi-sequence-order descriptors for a given protein sequence.

Parameters:
  • ProteinSequence (str) – a pure protein sequence
  • maxlag (int, optional (default: 30)) – the maximum lag and the length of the protein should be larger

Examples

>>> from propy.GetProteinFromUniprot import GetProteinSequence
>>> protein = GetProteinSequence(ProteinID="Q9NQ39")
>>> result = GetQuasiSequenceOrder1Grant(protein)

see GetQuasiSequenceOrder() for the choice of parameters.

propy.QuasiSequenceOrder.GetQuasiSequenceOrder1SW(ProteinSequence: str, maxlag=30, weight=0.1, distancematrix={'AA': 0.0, 'AC': 0.112, 'AD': 0.819, 'AE': 0.827, 'AF': 0.54, 'AG': 0.208, 'AH': 0.696, 'AI': 0.407, 'AK': 0.891, 'AL': 0.406, 'AM': 0.379, 'AN': 0.318, 'AP': 0.191, 'AQ': 0.372, 'AR': 1.0, 'AS': 0.094, 'AT': 0.22, 'AV': 0.273, 'AW': 0.739, 'AY': 0.552, 'CA': 0.114, 'CC': 0.0, 'CD': 0.847, 'CE': 0.838, 'CF': 0.437, 'CG': 0.32, 'CH': 0.66, 'CI': 0.304, 'CK': 0.887, 'CL': 0.301, 'CM': 0.277, 'CN': 0.324, 'CP': 0.157, 'CQ': 0.341, 'CR': 1.0, 'CS': 0.176, 'CT': 0.233, 'CV': 0.167, 'CW': 0.639, 'CY': 0.457, 'DA': 0.729, 'DC': 0.742, 'DD': 0.0, 'DE': 0.124, 'DF': 0.924, 'DG': 0.697, 'DH': 0.435, 'DI': 0.847, 'DK': 0.249, 'DL': 0.841, 'DM': 0.819, 'DN': 0.56, 'DP': 0.657, 'DQ': 0.584, 'DR': 0.295, 'DS': 0.667, 'DT': 0.649, 'DV': 0.797, 'DW': 1.0, 'DY': 0.836, 'EA': 0.79, 'EC': 0.788, 'ED': 0.133, 'EE': 0.0, 'EF': 0.932, 'EG': 0.779, 'EH': 0.406, 'EI': 0.86, 'EK': 0.143, 'EL': 0.854, 'EM': 0.83, 'EN': 0.599, 'EP': 0.688, 'EQ': 0.598, 'ER': 0.234, 'ES': 0.726, 'ET': 0.682, 'EV': 0.824, 'EW': 1.0, 'EY': 0.837, 'FA': 0.508, 'FC': 0.405, 'FD': 0.977, 'FE': 0.918, 'FF': 0.0, 'FG': 0.69, 'FH': 0.663, 'FI': 0.128, 'FK': 0.903, 'FL': 0.131, 'FM': 0.169, 'FN': 0.541, 'FP': 0.42, 'FQ': 0.459, 'FR': 1.0, 'FS': 0.548, 'FT': 0.499, 'FV': 0.252, 'FW': 0.207, 'FY': 0.179, 'GA': 0.206, 'GC': 0.312, 'GD': 0.776, 'GE': 0.807, 'GF': 0.727, 'GG': 0.0, 'GH': 0.769, 'GI': 0.592, 'GK': 0.894, 'GL': 0.591, 'GM': 0.557, 'GN': 0.381, 'GP': 0.323, 'GQ': 0.467, 'GR': 1.0, 'GS': 0.158, 'GT': 0.272, 'GV': 0.464, 'GW': 0.923, 'GY': 0.728, 'HA': 0.896, 'HC': 0.836, 'HD': 0.629, 'HE': 0.547, 'HF': 0.907, 'HG': 1.0, 'HH': 0.0, 'HI': 0.848, 'HK': 0.566, 'HL': 0.842, 'HM': 0.825, 'HN': 0.754, 'HP': 0.777, 'HQ': 0.716, 'HR': 0.697, 'HS': 0.865, 'HT': 0.834, 'HV': 0.831, 'HW': 0.981, 'HY': 0.821, 'IA': 0.403, 'IC': 0.296, 'ID': 0.942, 'IE': 0.891, 'IF': 0.134, 'IG': 0.592, 'IH': 0.652, 'II': 0.0, 'IK': 0.892, 'IL': 0.013, 'IM': 0.057, 'IN': 0.457, 'IP': 0.311, 'IQ': 0.383, 'IR': 1.0, 'IS': 0.443, 'IT': 0.396, 'IV': 0.133, 'IW': 0.339, 'IY': 0.213, 'KA': 0.889, 'KC': 0.871, 'KD': 0.279, 'KE': 0.149, 'KF': 0.957, 'KG': 0.9, 'KH': 0.438, 'KI': 0.899, 'KK': 0.0, 'KL': 0.892, 'KM': 0.871, 'KN': 0.667, 'KP': 0.757, 'KQ': 0.639, 'KR': 0.154, 'KS': 0.825, 'KT': 0.759, 'KV': 0.882, 'KW': 1.0, 'KY': 0.848, 'LA': 0.405, 'LC': 0.296, 'LD': 0.944, 'LE': 0.892, 'LF': 0.139, 'LG': 0.596, 'LH': 0.653, 'LI': 0.013, 'LK': 0.893, 'LL': 0.0, 'LM': 0.062, 'LN': 0.452, 'LP': 0.309, 'LQ': 0.376, 'LR': 1.0, 'LS': 0.443, 'LT': 0.397, 'LV': 0.133, 'LW': 0.341, 'LY': 0.205, 'MA': 0.383, 'MC': 0.276, 'MD': 0.932, 'ME': 0.879, 'MF': 0.182, 'MG': 0.569, 'MH': 0.648, 'MI': 0.058, 'MK': 0.884, 'ML': 0.062, 'MM': 0.0, 'MN': 0.447, 'MP': 0.285, 'MQ': 0.372, 'MR': 1.0, 'MS': 0.417, 'MT': 0.358, 'MV': 0.12, 'MW': 0.391, 'MY': 0.255, 'NA': 0.424, 'NC': 0.425, 'ND': 0.838, 'NE': 0.835, 'NF': 0.766, 'NG': 0.512, 'NH': 0.78, 'NI': 0.615, 'NK': 0.891, 'NL': 0.603, 'NM': 0.588, 'NN': 0.0, 'NP': 0.266, 'NQ': 0.175, 'NR': 1.0, 'NS': 0.361, 'NT': 0.368, 'NV': 0.503, 'NW': 0.945, 'NY': 0.641, 'PA': 0.22, 'PC': 0.179, 'PD': 0.852, 'PE': 0.831, 'PF': 0.515, 'PG': 0.376, 'PH': 0.696, 'PI': 0.363, 'PK': 0.875, 'PL': 0.357, 'PM': 0.326, 'PN': 0.231, 'PP': 0.0, 'PQ': 0.228, 'PR': 1.0, 'PS': 0.196, 'PT': 0.161, 'PV': 0.244, 'PW': 0.72, 'PY': 0.481, 'QA': 0.512, 'QC': 0.462, 'QD': 0.903, 'QE': 0.861, 'QF': 0.671, 'QG': 0.648, 'QH': 0.765, 'QI': 0.532, 'QK': 0.881, 'QL': 0.518, 'QM': 0.505, 'QN': 0.181, 'QP': 0.272, 'QQ': 0.0, 'QR': 1.0, 'QS': 0.461, 'QT': 0.389, 'QV': 0.464, 'QW': 0.831, 'QY': 0.522, 'RA': 0.919, 'RC': 0.905, 'RD': 0.305, 'RE': 0.225, 'RF': 0.977, 'RG': 0.928, 'RH': 0.498, 'RI': 0.929, 'RK': 0.141, 'RL': 0.92, 'RM': 0.908, 'RN': 0.69, 'RP': 0.796, 'RQ': 0.668, 'RR': 0.0, 'RS': 0.86, 'RT': 0.808, 'RV': 0.914, 'RW': 1.0, 'RY': 0.859, 'SA': 0.1, 'SC': 0.185, 'SD': 0.801, 'SE': 0.812, 'SF': 0.622, 'SG': 0.17, 'SH': 0.718, 'SI': 0.478, 'SK': 0.883, 'SL': 0.474, 'SM': 0.44, 'SN': 0.289, 'SP': 0.181, 'SQ': 0.358, 'SR': 1.0, 'SS': 0.0, 'ST': 0.174, 'SV': 0.342, 'SW': 0.827, 'SY': 0.615, 'TA': 0.251, 'TC': 0.261, 'TD': 0.83, 'TE': 0.812, 'TF': 0.604, 'TG': 0.312, 'TH': 0.737, 'TI': 0.455, 'TK': 0.866, 'TL': 0.453, 'TM': 0.403, 'TN': 0.315, 'TP': 0.159, 'TQ': 0.322, 'TR': 1.0, 'TS': 0.185, 'TT': 0.0, 'TV': 0.345, 'TW': 0.816, 'TY': 0.596, 'VA': 0.275, 'VC': 0.165, 'VD': 0.9, 'VE': 0.867, 'VF': 0.269, 'VG': 0.471, 'VH': 0.649, 'VI': 0.135, 'VK': 0.889, 'VL': 0.134, 'VM': 0.12, 'VN': 0.38, 'VP': 0.212, 'VQ': 0.339, 'VR': 1.0, 'VS': 0.322, 'VT': 0.305, 'VV': 0.0, 'VW': 0.472, 'VY': 0.31, 'WA': 0.658, 'WC': 0.56, 'WD': 1.0, 'WE': 0.931, 'WF': 0.196, 'WG': 0.829, 'WH': 0.678, 'WI': 0.305, 'WK': 0.892, 'WL': 0.304, 'WM': 0.344, 'WN': 0.631, 'WP': 0.555, 'WQ': 0.538, 'WR': 0.968, 'WS': 0.689, 'WT': 0.638, 'WV': 0.418, 'WW': 0.0, 'WY': 0.204, 'YA': 0.587, 'YC': 0.478, 'YD': 1.0, 'YE': 0.932, 'YF': 0.202, 'YG': 0.782, 'YH': 0.678, 'YI': 0.23, 'YK': 0.904, 'YL': 0.219, 'YM': 0.268, 'YN': 0.512, 'YP': 0.444, 'YQ': 0.404, 'YR': 0.995, 'YS': 0.612, 'YT': 0.557, 'YV': 0.328, 'YW': 0.244, 'YY': 0.0})[source]

Compute the first 20 quasi-sequence-order descriptors for a given protein sequence.

Parameters:
  • ProteinSequence (str) – a pure protein sequence
  • maxlag (int, optional (default: 30)) – the maximum lag and the length of the protein should be larger

Examples

>>> from propy.GetProteinFromUniprot import GetProteinSequence
>>> protein = GetProteinSequence(ProteinID="Q9NQ39")
>>> result = GetQuasiSequenceOrder1SW(protein)

see GetQuasiSequenceOrder() for the choice of parameters.

propy.QuasiSequenceOrder.GetQuasiSequenceOrder2(ProteinSequence: str, maxlag=30, weight=0.1, distancematrix=None)[source]

Compute the last maxlag quasi-sequence-order descriptors for a given protein sequence.

Parameters:
  • ProteinSequence (str) – a pure protein sequence
  • maxlag (int, optional (default: 30)) – the maximum lag and the length of the protein should be larger

Examples

>>> from propy.GetProteinFromUniprot import GetProteinSequence
>>> protein = GetProteinSequence(ProteinID="Q9NQ39")
>>> result = GetQuasiSequenceOrder2(protein)

see GetQuasiSequenceOrder() for the choice of parameters.

propy.QuasiSequenceOrder.GetQuasiSequenceOrder2Grant(ProteinSequence: str, maxlag: int = 30, weight: float = 0.1, distancematrix={'AA': 0, 'AC': 195, 'AD': 126, 'AE': 107, 'AF': 113, 'AG': 60, 'AH': 86, 'AI': 94, 'AK': 106, 'AL': 96, 'AM': 84, 'AN': 111, 'AP': 27, 'AQ': 91, 'AR': 112, 'AS': 99, 'AT': 58, 'AV': 64, 'AW': 148, 'AY': 112, 'CA': 195, 'CC': 0, 'CD': 154, 'CE': 170, 'CF': 205, 'CG': 159, 'CH': 174, 'CI': 198, 'CK': 202, 'CL': 198, 'CM': 196, 'CN': 139, 'CP': 169, 'CQ': 154, 'CR': 180, 'CS': 112, 'CT': 149, 'CV': 192, 'CW': 215, 'CY': 194, 'DA': 126, 'DC': 154, 'DD': 0, 'DE': 45, 'DF': 177, 'DG': 94, 'DH': 81, 'DI': 168, 'DK': 101, 'DL': 172, 'DM': 160, 'DN': 23, 'DP': 108, 'DQ': 61, 'DR': 96, 'DS': 65, 'DT': 85, 'DV': 152, 'DW': 181, 'DY': 160, 'EA': 107, 'EC': 170, 'ED': 45, 'EE': 0, 'EF': 140, 'EG': 98, 'EH': 40, 'EI': 134, 'EK': 56, 'EL': 138, 'EM': 126, 'EN': 42, 'EP': 93, 'EQ': 29, 'ER': 54, 'ES': 80, 'ET': 65, 'EV': 121, 'EW': 152, 'EY': 122, 'FA': 113, 'FC': 205, 'FD': 177, 'FE': 140, 'FF': 0, 'FG': 153, 'FH': 100, 'FI': 21, 'FK': 102, 'FL': 22, 'FM': 28, 'FN': 158, 'FP': 114, 'FQ': 116, 'FR': 97, 'FS': 155, 'FT': 103, 'FV': 50, 'FW': 40, 'FY': 22, 'GA': 60, 'GC': 159, 'GD': 94, 'GE': 98, 'GF': 153, 'GG': 0, 'GH': 98, 'GI': 135, 'GK': 127, 'GL': 138, 'GM': 127, 'GN': 80, 'GP': 42, 'GQ': 87, 'GR': 125, 'GS': 56, 'GT': 59, 'GV': 109, 'GW': 184, 'GY': 147, 'HA': 86, 'HC': 174, 'HD': 81, 'HE': 40, 'HF': 100, 'HG': 98, 'HH': 0, 'HI': 94, 'HK': 32, 'HL': 99, 'HM': 87, 'HN': 68, 'HP': 77, 'HQ': 24, 'HR': 29, 'HS': 89, 'HT': 47, 'HV': 84, 'HW': 115, 'HY': 83, 'IA': 94, 'IC': 198, 'ID': 168, 'IE': 134, 'IF': 21, 'IG': 135, 'IH': 94, 'II': 0, 'IK': 102, 'IL': 5, 'IM': 10, 'IN': 149, 'IP': 95, 'IQ': 109, 'IR': 97, 'IS': 142, 'IT': 89, 'IV': 29, 'IW': 61, 'IY': 33, 'KA': 106, 'KC': 202, 'KD': 101, 'KE': 56, 'KF': 102, 'KG': 127, 'KH': 32, 'KI': 102, 'KK': 0, 'KL': 107, 'KM': 95, 'KN': 94, 'KP': 103, 'KQ': 53, 'KR': 26, 'KS': 121, 'KT': 78, 'KV': 97, 'KW': 110, 'KY': 85, 'LA': 96, 'LC': 198, 'LD': 172, 'LE': 138, 'LF': 22, 'LG': 138, 'LH': 99, 'LI': 5, 'LK': 107, 'LL': 0, 'LM': 15, 'LN': 153, 'LP': 98, 'LQ': 113, 'LR': 102, 'LS': 145, 'LT': 92, 'LV': 32, 'LW': 61, 'LY': 36, 'MA': 84, 'MC': 196, 'MD': 160, 'ME': 126, 'MF': 28, 'MG': 127, 'MH': 87, 'MI': 10, 'MK': 95, 'ML': 15, 'MM': 0, 'MN': 142, 'MP': 87, 'MQ': 101, 'MR': 91, 'MS': 135, 'MT': 81, 'MV': 21, 'MW': 67, 'MY': 36, 'NA': 111, 'NC': 139, 'ND': 23, 'NE': 42, 'NF': 158, 'NG': 80, 'NH': 68, 'NI': 149, 'NK': 94, 'NL': 153, 'NM': 142, 'NN': 0, 'NP': 91, 'NQ': 46, 'NR': 86, 'NS': 46, 'NT': 65, 'NV': 133, 'NW': 174, 'NY': 143, 'PA': 27, 'PC': 169, 'PD': 108, 'PE': 93, 'PF': 114, 'PG': 42, 'PH': 77, 'PI': 95, 'PK': 103, 'PL': 98, 'PM': 87, 'PN': 91, 'PP': 0, 'PQ': 76, 'PR': 103, 'PS': 74, 'PT': 38, 'PV': 68, 'PW': 147, 'PY': 110, 'QA': 91, 'QC': 154, 'QD': 61, 'QE': 29, 'QF': 116, 'QG': 87, 'QH': 24, 'QI': 109, 'QK': 53, 'QL': 113, 'QM': 101, 'QN': 46, 'QP': 76, 'QQ': 0, 'QR': 43, 'QS': 68, 'QT': 42, 'QV': 96, 'QW': 130, 'QY': 99, 'RA': 112, 'RC': 180, 'RD': 96, 'RE': 54, 'RF': 97, 'RG': 125, 'RH': 29, 'RI': 97, 'RK': 26, 'RL': 102, 'RM': 91, 'RN': 86, 'RP': 103, 'RQ': 43, 'RR': 0, 'RS': 110, 'RT': 71, 'RV': 96, 'RW': 101, 'RY': 77, 'SA': 99, 'SC': 112, 'SD': 65, 'SE': 80, 'SF': 155, 'SG': 56, 'SH': 89, 'SI': 142, 'SK': 121, 'SL': 145, 'SM': 135, 'SN': 46, 'SP': 74, 'SQ': 68, 'SR': 110, 'SS': 0, 'ST': 58, 'SV': 124, 'SW': 177, 'SY': 144, 'TA': 58, 'TC': 149, 'TD': 85, 'TE': 65, 'TF': 103, 'TG': 59, 'TH': 47, 'TI': 89, 'TK': 78, 'TL': 92, 'TM': 81, 'TN': 65, 'TP': 38, 'TQ': 42, 'TR': 71, 'TS': 58, 'TT': 0, 'TV': 69, 'TW': 128, 'TY': 92, 'VA': 64, 'VC': 192, 'VD': 152, 'VE': 121, 'VF': 50, 'VG': 109, 'VH': 84, 'VI': 29, 'VK': 97, 'VL': 32, 'VM': 21, 'VN': 133, 'VP': 68, 'VQ': 96, 'VR': 96, 'VS': 124, 'VT': 69, 'VV': 0, 'VW': 88, 'VY': 55, 'WA': 148, 'WC': 215, 'WD': 181, 'WE': 152, 'WF': 40, 'WG': 184, 'WH': 115, 'WI': 61, 'WK': 110, 'WL': 61, 'WM': 67, 'WN': 174, 'WP': 147, 'WQ': 130, 'WR': 101, 'WS': 177, 'WT': 128, 'WV': 88, 'WW': 0, 'WY': 37, 'YA': 112, 'YC': 194, 'YD': 160, 'YE': 122, 'YF': 22, 'YG': 147, 'YH': 83, 'YI': 33, 'YK': 85, 'YL': 36, 'YM': 36, 'YN': 143, 'YP': 110, 'YQ': 99, 'YR': 77, 'YS': 144, 'YT': 92, 'YV': 55, 'YW': 37, 'YY': 0})[source]

Compute the last maxlag quasi-sequence-order descriptors for a given protein sequence.

Parameters:
  • ProteinSequence (str) – a pure protein sequence
  • maxlag (int, optional (default: 30)) – the maximum lag and the length of the protein should be larger

Examples

>>> from propy.GetProteinFromUniprot import GetProteinSequence
>>> protein = GetProteinSequence(ProteinID="Q9NQ39")
>>> result = GetQuasiSequenceOrder2Grant(protein)

see GetQuasiSequenceOrder() for the choice of parameters.

propy.QuasiSequenceOrder.GetQuasiSequenceOrder2SW(ProteinSequence: str, maxlag=30, weight=0.1, distancematrix={'AA': 0.0, 'AC': 0.112, 'AD': 0.819, 'AE': 0.827, 'AF': 0.54, 'AG': 0.208, 'AH': 0.696, 'AI': 0.407, 'AK': 0.891, 'AL': 0.406, 'AM': 0.379, 'AN': 0.318, 'AP': 0.191, 'AQ': 0.372, 'AR': 1.0, 'AS': 0.094, 'AT': 0.22, 'AV': 0.273, 'AW': 0.739, 'AY': 0.552, 'CA': 0.114, 'CC': 0.0, 'CD': 0.847, 'CE': 0.838, 'CF': 0.437, 'CG': 0.32, 'CH': 0.66, 'CI': 0.304, 'CK': 0.887, 'CL': 0.301, 'CM': 0.277, 'CN': 0.324, 'CP': 0.157, 'CQ': 0.341, 'CR': 1.0, 'CS': 0.176, 'CT': 0.233, 'CV': 0.167, 'CW': 0.639, 'CY': 0.457, 'DA': 0.729, 'DC': 0.742, 'DD': 0.0, 'DE': 0.124, 'DF': 0.924, 'DG': 0.697, 'DH': 0.435, 'DI': 0.847, 'DK': 0.249, 'DL': 0.841, 'DM': 0.819, 'DN': 0.56, 'DP': 0.657, 'DQ': 0.584, 'DR': 0.295, 'DS': 0.667, 'DT': 0.649, 'DV': 0.797, 'DW': 1.0, 'DY': 0.836, 'EA': 0.79, 'EC': 0.788, 'ED': 0.133, 'EE': 0.0, 'EF': 0.932, 'EG': 0.779, 'EH': 0.406, 'EI': 0.86, 'EK': 0.143, 'EL': 0.854, 'EM': 0.83, 'EN': 0.599, 'EP': 0.688, 'EQ': 0.598, 'ER': 0.234, 'ES': 0.726, 'ET': 0.682, 'EV': 0.824, 'EW': 1.0, 'EY': 0.837, 'FA': 0.508, 'FC': 0.405, 'FD': 0.977, 'FE': 0.918, 'FF': 0.0, 'FG': 0.69, 'FH': 0.663, 'FI': 0.128, 'FK': 0.903, 'FL': 0.131, 'FM': 0.169, 'FN': 0.541, 'FP': 0.42, 'FQ': 0.459, 'FR': 1.0, 'FS': 0.548, 'FT': 0.499, 'FV': 0.252, 'FW': 0.207, 'FY': 0.179, 'GA': 0.206, 'GC': 0.312, 'GD': 0.776, 'GE': 0.807, 'GF': 0.727, 'GG': 0.0, 'GH': 0.769, 'GI': 0.592, 'GK': 0.894, 'GL': 0.591, 'GM': 0.557, 'GN': 0.381, 'GP': 0.323, 'GQ': 0.467, 'GR': 1.0, 'GS': 0.158, 'GT': 0.272, 'GV': 0.464, 'GW': 0.923, 'GY': 0.728, 'HA': 0.896, 'HC': 0.836, 'HD': 0.629, 'HE': 0.547, 'HF': 0.907, 'HG': 1.0, 'HH': 0.0, 'HI': 0.848, 'HK': 0.566, 'HL': 0.842, 'HM': 0.825, 'HN': 0.754, 'HP': 0.777, 'HQ': 0.716, 'HR': 0.697, 'HS': 0.865, 'HT': 0.834, 'HV': 0.831, 'HW': 0.981, 'HY': 0.821, 'IA': 0.403, 'IC': 0.296, 'ID': 0.942, 'IE': 0.891, 'IF': 0.134, 'IG': 0.592, 'IH': 0.652, 'II': 0.0, 'IK': 0.892, 'IL': 0.013, 'IM': 0.057, 'IN': 0.457, 'IP': 0.311, 'IQ': 0.383, 'IR': 1.0, 'IS': 0.443, 'IT': 0.396, 'IV': 0.133, 'IW': 0.339, 'IY': 0.213, 'KA': 0.889, 'KC': 0.871, 'KD': 0.279, 'KE': 0.149, 'KF': 0.957, 'KG': 0.9, 'KH': 0.438, 'KI': 0.899, 'KK': 0.0, 'KL': 0.892, 'KM': 0.871, 'KN': 0.667, 'KP': 0.757, 'KQ': 0.639, 'KR': 0.154, 'KS': 0.825, 'KT': 0.759, 'KV': 0.882, 'KW': 1.0, 'KY': 0.848, 'LA': 0.405, 'LC': 0.296, 'LD': 0.944, 'LE': 0.892, 'LF': 0.139, 'LG': 0.596, 'LH': 0.653, 'LI': 0.013, 'LK': 0.893, 'LL': 0.0, 'LM': 0.062, 'LN': 0.452, 'LP': 0.309, 'LQ': 0.376, 'LR': 1.0, 'LS': 0.443, 'LT': 0.397, 'LV': 0.133, 'LW': 0.341, 'LY': 0.205, 'MA': 0.383, 'MC': 0.276, 'MD': 0.932, 'ME': 0.879, 'MF': 0.182, 'MG': 0.569, 'MH': 0.648, 'MI': 0.058, 'MK': 0.884, 'ML': 0.062, 'MM': 0.0, 'MN': 0.447, 'MP': 0.285, 'MQ': 0.372, 'MR': 1.0, 'MS': 0.417, 'MT': 0.358, 'MV': 0.12, 'MW': 0.391, 'MY': 0.255, 'NA': 0.424, 'NC': 0.425, 'ND': 0.838, 'NE': 0.835, 'NF': 0.766, 'NG': 0.512, 'NH': 0.78, 'NI': 0.615, 'NK': 0.891, 'NL': 0.603, 'NM': 0.588, 'NN': 0.0, 'NP': 0.266, 'NQ': 0.175, 'NR': 1.0, 'NS': 0.361, 'NT': 0.368, 'NV': 0.503, 'NW': 0.945, 'NY': 0.641, 'PA': 0.22, 'PC': 0.179, 'PD': 0.852, 'PE': 0.831, 'PF': 0.515, 'PG': 0.376, 'PH': 0.696, 'PI': 0.363, 'PK': 0.875, 'PL': 0.357, 'PM': 0.326, 'PN': 0.231, 'PP': 0.0, 'PQ': 0.228, 'PR': 1.0, 'PS': 0.196, 'PT': 0.161, 'PV': 0.244, 'PW': 0.72, 'PY': 0.481, 'QA': 0.512, 'QC': 0.462, 'QD': 0.903, 'QE': 0.861, 'QF': 0.671, 'QG': 0.648, 'QH': 0.765, 'QI': 0.532, 'QK': 0.881, 'QL': 0.518, 'QM': 0.505, 'QN': 0.181, 'QP': 0.272, 'QQ': 0.0, 'QR': 1.0, 'QS': 0.461, 'QT': 0.389, 'QV': 0.464, 'QW': 0.831, 'QY': 0.522, 'RA': 0.919, 'RC': 0.905, 'RD': 0.305, 'RE': 0.225, 'RF': 0.977, 'RG': 0.928, 'RH': 0.498, 'RI': 0.929, 'RK': 0.141, 'RL': 0.92, 'RM': 0.908, 'RN': 0.69, 'RP': 0.796, 'RQ': 0.668, 'RR': 0.0, 'RS': 0.86, 'RT': 0.808, 'RV': 0.914, 'RW': 1.0, 'RY': 0.859, 'SA': 0.1, 'SC': 0.185, 'SD': 0.801, 'SE': 0.812, 'SF': 0.622, 'SG': 0.17, 'SH': 0.718, 'SI': 0.478, 'SK': 0.883, 'SL': 0.474, 'SM': 0.44, 'SN': 0.289, 'SP': 0.181, 'SQ': 0.358, 'SR': 1.0, 'SS': 0.0, 'ST': 0.174, 'SV': 0.342, 'SW': 0.827, 'SY': 0.615, 'TA': 0.251, 'TC': 0.261, 'TD': 0.83, 'TE': 0.812, 'TF': 0.604, 'TG': 0.312, 'TH': 0.737, 'TI': 0.455, 'TK': 0.866, 'TL': 0.453, 'TM': 0.403, 'TN': 0.315, 'TP': 0.159, 'TQ': 0.322, 'TR': 1.0, 'TS': 0.185, 'TT': 0.0, 'TV': 0.345, 'TW': 0.816, 'TY': 0.596, 'VA': 0.275, 'VC': 0.165, 'VD': 0.9, 'VE': 0.867, 'VF': 0.269, 'VG': 0.471, 'VH': 0.649, 'VI': 0.135, 'VK': 0.889, 'VL': 0.134, 'VM': 0.12, 'VN': 0.38, 'VP': 0.212, 'VQ': 0.339, 'VR': 1.0, 'VS': 0.322, 'VT': 0.305, 'VV': 0.0, 'VW': 0.472, 'VY': 0.31, 'WA': 0.658, 'WC': 0.56, 'WD': 1.0, 'WE': 0.931, 'WF': 0.196, 'WG': 0.829, 'WH': 0.678, 'WI': 0.305, 'WK': 0.892, 'WL': 0.304, 'WM': 0.344, 'WN': 0.631, 'WP': 0.555, 'WQ': 0.538, 'WR': 0.968, 'WS': 0.689, 'WT': 0.638, 'WV': 0.418, 'WW': 0.0, 'WY': 0.204, 'YA': 0.587, 'YC': 0.478, 'YD': 1.0, 'YE': 0.932, 'YF': 0.202, 'YG': 0.782, 'YH': 0.678, 'YI': 0.23, 'YK': 0.904, 'YL': 0.219, 'YM': 0.268, 'YN': 0.512, 'YP': 0.444, 'YQ': 0.404, 'YR': 0.995, 'YS': 0.612, 'YT': 0.557, 'YV': 0.328, 'YW': 0.244, 'YY': 0.0})[source]

Compute the last maxlag quasi-sequence-order descriptors for a given protein sequence.

Parameters:
  • ProteinSequence (str) – a pure protein sequence
  • maxlag (int, optional (default: 30)) – the maximum lag and the length of the protein should be larger

Examples

>>> from propy.GetProteinFromUniprot import GetProteinSequence
>>> protein = GetProteinSequence(ProteinID="Q9NQ39")
>>> result = GetQuasiSequenceOrder2SW(protein)

see GetQuasiSequenceOrder() for the choice of parameters.

propy.QuasiSequenceOrder.GetQuasiSequenceOrderp(ProteinSequence: str, maxlag: int = 30, weight: float = 0.1, distancematrix: Dict[Any, Any] = None) → Dict[Any, Any][source]

Compute quasi-sequence-order descriptors for a given protein.

See [1] for details.

Parameters:
  • ProteinSequence (str) – a pure protein sequence
  • maxlag (int, optional (default: 30)) – the maximum lag and the length of the protein should be larger than maxlag
  • weight (float, optional (default: 0.1)) – a weight factor. Please see reference 1 for its choice.
  • distancematrix (Dict[Any, Any]) – contains 400 distance values
Returns:

result – contains all quasi-sequence-order descriptors

Return type:

Dict[Any, Any]

Examples

>>> from propy.GetProteinFromUniprot import GetProteinSequence
>>> protein = GetProteinSequence(ProteinID="Q9NQ39")
>>> result = GetQuasiSequenceOrderp(protein)
propy.QuasiSequenceOrder.GetSequenceOrderCouplingNumber(ProteinSequence: str, d: int = 1, distancematrix: Dict[str, float] = {'AA': 0.0, 'AC': 0.112, 'AD': 0.819, 'AE': 0.827, 'AF': 0.54, 'AG': 0.208, 'AH': 0.696, 'AI': 0.407, 'AK': 0.891, 'AL': 0.406, 'AM': 0.379, 'AN': 0.318, 'AP': 0.191, 'AQ': 0.372, 'AR': 1.0, 'AS': 0.094, 'AT': 0.22, 'AV': 0.273, 'AW': 0.739, 'AY': 0.552, 'CA': 0.114, 'CC': 0.0, 'CD': 0.847, 'CE': 0.838, 'CF': 0.437, 'CG': 0.32, 'CH': 0.66, 'CI': 0.304, 'CK': 0.887, 'CL': 0.301, 'CM': 0.277, 'CN': 0.324, 'CP': 0.157, 'CQ': 0.341, 'CR': 1.0, 'CS': 0.176, 'CT': 0.233, 'CV': 0.167, 'CW': 0.639, 'CY': 0.457, 'DA': 0.729, 'DC': 0.742, 'DD': 0.0, 'DE': 0.124, 'DF': 0.924, 'DG': 0.697, 'DH': 0.435, 'DI': 0.847, 'DK': 0.249, 'DL': 0.841, 'DM': 0.819, 'DN': 0.56, 'DP': 0.657, 'DQ': 0.584, 'DR': 0.295, 'DS': 0.667, 'DT': 0.649, 'DV': 0.797, 'DW': 1.0, 'DY': 0.836, 'EA': 0.79, 'EC': 0.788, 'ED': 0.133, 'EE': 0.0, 'EF': 0.932, 'EG': 0.779, 'EH': 0.406, 'EI': 0.86, 'EK': 0.143, 'EL': 0.854, 'EM': 0.83, 'EN': 0.599, 'EP': 0.688, 'EQ': 0.598, 'ER': 0.234, 'ES': 0.726, 'ET': 0.682, 'EV': 0.824, 'EW': 1.0, 'EY': 0.837, 'FA': 0.508, 'FC': 0.405, 'FD': 0.977, 'FE': 0.918, 'FF': 0.0, 'FG': 0.69, 'FH': 0.663, 'FI': 0.128, 'FK': 0.903, 'FL': 0.131, 'FM': 0.169, 'FN': 0.541, 'FP': 0.42, 'FQ': 0.459, 'FR': 1.0, 'FS': 0.548, 'FT': 0.499, 'FV': 0.252, 'FW': 0.207, 'FY': 0.179, 'GA': 0.206, 'GC': 0.312, 'GD': 0.776, 'GE': 0.807, 'GF': 0.727, 'GG': 0.0, 'GH': 0.769, 'GI': 0.592, 'GK': 0.894, 'GL': 0.591, 'GM': 0.557, 'GN': 0.381, 'GP': 0.323, 'GQ': 0.467, 'GR': 1.0, 'GS': 0.158, 'GT': 0.272, 'GV': 0.464, 'GW': 0.923, 'GY': 0.728, 'HA': 0.896, 'HC': 0.836, 'HD': 0.629, 'HE': 0.547, 'HF': 0.907, 'HG': 1.0, 'HH': 0.0, 'HI': 0.848, 'HK': 0.566, 'HL': 0.842, 'HM': 0.825, 'HN': 0.754, 'HP': 0.777, 'HQ': 0.716, 'HR': 0.697, 'HS': 0.865, 'HT': 0.834, 'HV': 0.831, 'HW': 0.981, 'HY': 0.821, 'IA': 0.403, 'IC': 0.296, 'ID': 0.942, 'IE': 0.891, 'IF': 0.134, 'IG': 0.592, 'IH': 0.652, 'II': 0.0, 'IK': 0.892, 'IL': 0.013, 'IM': 0.057, 'IN': 0.457, 'IP': 0.311, 'IQ': 0.383, 'IR': 1.0, 'IS': 0.443, 'IT': 0.396, 'IV': 0.133, 'IW': 0.339, 'IY': 0.213, 'KA': 0.889, 'KC': 0.871, 'KD': 0.279, 'KE': 0.149, 'KF': 0.957, 'KG': 0.9, 'KH': 0.438, 'KI': 0.899, 'KK': 0.0, 'KL': 0.892, 'KM': 0.871, 'KN': 0.667, 'KP': 0.757, 'KQ': 0.639, 'KR': 0.154, 'KS': 0.825, 'KT': 0.759, 'KV': 0.882, 'KW': 1.0, 'KY': 0.848, 'LA': 0.405, 'LC': 0.296, 'LD': 0.944, 'LE': 0.892, 'LF': 0.139, 'LG': 0.596, 'LH': 0.653, 'LI': 0.013, 'LK': 0.893, 'LL': 0.0, 'LM': 0.062, 'LN': 0.452, 'LP': 0.309, 'LQ': 0.376, 'LR': 1.0, 'LS': 0.443, 'LT': 0.397, 'LV': 0.133, 'LW': 0.341, 'LY': 0.205, 'MA': 0.383, 'MC': 0.276, 'MD': 0.932, 'ME': 0.879, 'MF': 0.182, 'MG': 0.569, 'MH': 0.648, 'MI': 0.058, 'MK': 0.884, 'ML': 0.062, 'MM': 0.0, 'MN': 0.447, 'MP': 0.285, 'MQ': 0.372, 'MR': 1.0, 'MS': 0.417, 'MT': 0.358, 'MV': 0.12, 'MW': 0.391, 'MY': 0.255, 'NA': 0.424, 'NC': 0.425, 'ND': 0.838, 'NE': 0.835, 'NF': 0.766, 'NG': 0.512, 'NH': 0.78, 'NI': 0.615, 'NK': 0.891, 'NL': 0.603, 'NM': 0.588, 'NN': 0.0, 'NP': 0.266, 'NQ': 0.175, 'NR': 1.0, 'NS': 0.361, 'NT': 0.368, 'NV': 0.503, 'NW': 0.945, 'NY': 0.641, 'PA': 0.22, 'PC': 0.179, 'PD': 0.852, 'PE': 0.831, 'PF': 0.515, 'PG': 0.376, 'PH': 0.696, 'PI': 0.363, 'PK': 0.875, 'PL': 0.357, 'PM': 0.326, 'PN': 0.231, 'PP': 0.0, 'PQ': 0.228, 'PR': 1.0, 'PS': 0.196, 'PT': 0.161, 'PV': 0.244, 'PW': 0.72, 'PY': 0.481, 'QA': 0.512, 'QC': 0.462, 'QD': 0.903, 'QE': 0.861, 'QF': 0.671, 'QG': 0.648, 'QH': 0.765, 'QI': 0.532, 'QK': 0.881, 'QL': 0.518, 'QM': 0.505, 'QN': 0.181, 'QP': 0.272, 'QQ': 0.0, 'QR': 1.0, 'QS': 0.461, 'QT': 0.389, 'QV': 0.464, 'QW': 0.831, 'QY': 0.522, 'RA': 0.919, 'RC': 0.905, 'RD': 0.305, 'RE': 0.225, 'RF': 0.977, 'RG': 0.928, 'RH': 0.498, 'RI': 0.929, 'RK': 0.141, 'RL': 0.92, 'RM': 0.908, 'RN': 0.69, 'RP': 0.796, 'RQ': 0.668, 'RR': 0.0, 'RS': 0.86, 'RT': 0.808, 'RV': 0.914, 'RW': 1.0, 'RY': 0.859, 'SA': 0.1, 'SC': 0.185, 'SD': 0.801, 'SE': 0.812, 'SF': 0.622, 'SG': 0.17, 'SH': 0.718, 'SI': 0.478, 'SK': 0.883, 'SL': 0.474, 'SM': 0.44, 'SN': 0.289, 'SP': 0.181, 'SQ': 0.358, 'SR': 1.0, 'SS': 0.0, 'ST': 0.174, 'SV': 0.342, 'SW': 0.827, 'SY': 0.615, 'TA': 0.251, 'TC': 0.261, 'TD': 0.83, 'TE': 0.812, 'TF': 0.604, 'TG': 0.312, 'TH': 0.737, 'TI': 0.455, 'TK': 0.866, 'TL': 0.453, 'TM': 0.403, 'TN': 0.315, 'TP': 0.159, 'TQ': 0.322, 'TR': 1.0, 'TS': 0.185, 'TT': 0.0, 'TV': 0.345, 'TW': 0.816, 'TY': 0.596, 'VA': 0.275, 'VC': 0.165, 'VD': 0.9, 'VE': 0.867, 'VF': 0.269, 'VG': 0.471, 'VH': 0.649, 'VI': 0.135, 'VK': 0.889, 'VL': 0.134, 'VM': 0.12, 'VN': 0.38, 'VP': 0.212, 'VQ': 0.339, 'VR': 1.0, 'VS': 0.322, 'VT': 0.305, 'VV': 0.0, 'VW': 0.472, 'VY': 0.31, 'WA': 0.658, 'WC': 0.56, 'WD': 1.0, 'WE': 0.931, 'WF': 0.196, 'WG': 0.829, 'WH': 0.678, 'WI': 0.305, 'WK': 0.892, 'WL': 0.304, 'WM': 0.344, 'WN': 0.631, 'WP': 0.555, 'WQ': 0.538, 'WR': 0.968, 'WS': 0.689, 'WT': 0.638, 'WV': 0.418, 'WW': 0.0, 'WY': 0.204, 'YA': 0.587, 'YC': 0.478, 'YD': 1.0, 'YE': 0.932, 'YF': 0.202, 'YG': 0.782, 'YH': 0.678, 'YI': 0.23, 'YK': 0.904, 'YL': 0.219, 'YM': 0.268, 'YN': 0.512, 'YP': 0.444, 'YQ': 0.404, 'YR': 0.995, 'YS': 0.612, 'YT': 0.557, 'YV': 0.328, 'YW': 0.244, 'YY': 0.0})[source]

Compute the dth-rank sequence order coupling number for a protein.

Parameters:
  • ProteinSequence (str) – a pure protein sequence
  • d (int) – the gap between two amino acids.
  • distancematrix (Dict[str, float]) –
Returns:

tau

Return type:

float

Example

>>> from propy.GetProteinFromUniprot import GetProteinSequence
>>> protein = GetProteinSequence(ProteinID="Q9NQ39")
>>> result = GetSequenceOrderCouplingNumber(protein)
propy.QuasiSequenceOrder.GetSequenceOrderCouplingNumberGrant(ProteinSequence: str, maxlag: int = 30, distancematrix={'AA': 0, 'AC': 195, 'AD': 126, 'AE': 107, 'AF': 113, 'AG': 60, 'AH': 86, 'AI': 94, 'AK': 106, 'AL': 96, 'AM': 84, 'AN': 111, 'AP': 27, 'AQ': 91, 'AR': 112, 'AS': 99, 'AT': 58, 'AV': 64, 'AW': 148, 'AY': 112, 'CA': 195, 'CC': 0, 'CD': 154, 'CE': 170, 'CF': 205, 'CG': 159, 'CH': 174, 'CI': 198, 'CK': 202, 'CL': 198, 'CM': 196, 'CN': 139, 'CP': 169, 'CQ': 154, 'CR': 180, 'CS': 112, 'CT': 149, 'CV': 192, 'CW': 215, 'CY': 194, 'DA': 126, 'DC': 154, 'DD': 0, 'DE': 45, 'DF': 177, 'DG': 94, 'DH': 81, 'DI': 168, 'DK': 101, 'DL': 172, 'DM': 160, 'DN': 23, 'DP': 108, 'DQ': 61, 'DR': 96, 'DS': 65, 'DT': 85, 'DV': 152, 'DW': 181, 'DY': 160, 'EA': 107, 'EC': 170, 'ED': 45, 'EE': 0, 'EF': 140, 'EG': 98, 'EH': 40, 'EI': 134, 'EK': 56, 'EL': 138, 'EM': 126, 'EN': 42, 'EP': 93, 'EQ': 29, 'ER': 54, 'ES': 80, 'ET': 65, 'EV': 121, 'EW': 152, 'EY': 122, 'FA': 113, 'FC': 205, 'FD': 177, 'FE': 140, 'FF': 0, 'FG': 153, 'FH': 100, 'FI': 21, 'FK': 102, 'FL': 22, 'FM': 28, 'FN': 158, 'FP': 114, 'FQ': 116, 'FR': 97, 'FS': 155, 'FT': 103, 'FV': 50, 'FW': 40, 'FY': 22, 'GA': 60, 'GC': 159, 'GD': 94, 'GE': 98, 'GF': 153, 'GG': 0, 'GH': 98, 'GI': 135, 'GK': 127, 'GL': 138, 'GM': 127, 'GN': 80, 'GP': 42, 'GQ': 87, 'GR': 125, 'GS': 56, 'GT': 59, 'GV': 109, 'GW': 184, 'GY': 147, 'HA': 86, 'HC': 174, 'HD': 81, 'HE': 40, 'HF': 100, 'HG': 98, 'HH': 0, 'HI': 94, 'HK': 32, 'HL': 99, 'HM': 87, 'HN': 68, 'HP': 77, 'HQ': 24, 'HR': 29, 'HS': 89, 'HT': 47, 'HV': 84, 'HW': 115, 'HY': 83, 'IA': 94, 'IC': 198, 'ID': 168, 'IE': 134, 'IF': 21, 'IG': 135, 'IH': 94, 'II': 0, 'IK': 102, 'IL': 5, 'IM': 10, 'IN': 149, 'IP': 95, 'IQ': 109, 'IR': 97, 'IS': 142, 'IT': 89, 'IV': 29, 'IW': 61, 'IY': 33, 'KA': 106, 'KC': 202, 'KD': 101, 'KE': 56, 'KF': 102, 'KG': 127, 'KH': 32, 'KI': 102, 'KK': 0, 'KL': 107, 'KM': 95, 'KN': 94, 'KP': 103, 'KQ': 53, 'KR': 26, 'KS': 121, 'KT': 78, 'KV': 97, 'KW': 110, 'KY': 85, 'LA': 96, 'LC': 198, 'LD': 172, 'LE': 138, 'LF': 22, 'LG': 138, 'LH': 99, 'LI': 5, 'LK': 107, 'LL': 0, 'LM': 15, 'LN': 153, 'LP': 98, 'LQ': 113, 'LR': 102, 'LS': 145, 'LT': 92, 'LV': 32, 'LW': 61, 'LY': 36, 'MA': 84, 'MC': 196, 'MD': 160, 'ME': 126, 'MF': 28, 'MG': 127, 'MH': 87, 'MI': 10, 'MK': 95, 'ML': 15, 'MM': 0, 'MN': 142, 'MP': 87, 'MQ': 101, 'MR': 91, 'MS': 135, 'MT': 81, 'MV': 21, 'MW': 67, 'MY': 36, 'NA': 111, 'NC': 139, 'ND': 23, 'NE': 42, 'NF': 158, 'NG': 80, 'NH': 68, 'NI': 149, 'NK': 94, 'NL': 153, 'NM': 142, 'NN': 0, 'NP': 91, 'NQ': 46, 'NR': 86, 'NS': 46, 'NT': 65, 'NV': 133, 'NW': 174, 'NY': 143, 'PA': 27, 'PC': 169, 'PD': 108, 'PE': 93, 'PF': 114, 'PG': 42, 'PH': 77, 'PI': 95, 'PK': 103, 'PL': 98, 'PM': 87, 'PN': 91, 'PP': 0, 'PQ': 76, 'PR': 103, 'PS': 74, 'PT': 38, 'PV': 68, 'PW': 147, 'PY': 110, 'QA': 91, 'QC': 154, 'QD': 61, 'QE': 29, 'QF': 116, 'QG': 87, 'QH': 24, 'QI': 109, 'QK': 53, 'QL': 113, 'QM': 101, 'QN': 46, 'QP': 76, 'QQ': 0, 'QR': 43, 'QS': 68, 'QT': 42, 'QV': 96, 'QW': 130, 'QY': 99, 'RA': 112, 'RC': 180, 'RD': 96, 'RE': 54, 'RF': 97, 'RG': 125, 'RH': 29, 'RI': 97, 'RK': 26, 'RL': 102, 'RM': 91, 'RN': 86, 'RP': 103, 'RQ': 43, 'RR': 0, 'RS': 110, 'RT': 71, 'RV': 96, 'RW': 101, 'RY': 77, 'SA': 99, 'SC': 112, 'SD': 65, 'SE': 80, 'SF': 155, 'SG': 56, 'SH': 89, 'SI': 142, 'SK': 121, 'SL': 145, 'SM': 135, 'SN': 46, 'SP': 74, 'SQ': 68, 'SR': 110, 'SS': 0, 'ST': 58, 'SV': 124, 'SW': 177, 'SY': 144, 'TA': 58, 'TC': 149, 'TD': 85, 'TE': 65, 'TF': 103, 'TG': 59, 'TH': 47, 'TI': 89, 'TK': 78, 'TL': 92, 'TM': 81, 'TN': 65, 'TP': 38, 'TQ': 42, 'TR': 71, 'TS': 58, 'TT': 0, 'TV': 69, 'TW': 128, 'TY': 92, 'VA': 64, 'VC': 192, 'VD': 152, 'VE': 121, 'VF': 50, 'VG': 109, 'VH': 84, 'VI': 29, 'VK': 97, 'VL': 32, 'VM': 21, 'VN': 133, 'VP': 68, 'VQ': 96, 'VR': 96, 'VS': 124, 'VT': 69, 'VV': 0, 'VW': 88, 'VY': 55, 'WA': 148, 'WC': 215, 'WD': 181, 'WE': 152, 'WF': 40, 'WG': 184, 'WH': 115, 'WI': 61, 'WK': 110, 'WL': 61, 'WM': 67, 'WN': 174, 'WP': 147, 'WQ': 130, 'WR': 101, 'WS': 177, 'WT': 128, 'WV': 88, 'WW': 0, 'WY': 37, 'YA': 112, 'YC': 194, 'YD': 160, 'YE': 122, 'YF': 22, 'YG': 147, 'YH': 83, 'YI': 33, 'YK': 85, 'YL': 36, 'YM': 36, 'YN': 143, 'YP': 110, 'YQ': 99, 'YR': 77, 'YS': 144, 'YT': 92, 'YV': 55, 'YW': 37, 'YY': 0})[source]

Compute the sequence order coupling numbers from 1 to maxlag for a given protein sequence based on the Grantham chemical distance matrix.

Parameters:
  • ProteinSequence (str) – a pure protein sequence
  • maxlag (int, optional (default: 30)) – the maximum lag and the length of the protein should be larger than maxlag
  • distancematrix (Dict[Any, Any]) – contains Schneider-Wrede physicochemical distance matrix
Returns:

Tau – contains all sequence order coupling numbers based on the Grantham chemical distance matrix

Return type:

Dict[Any, Any]

Examples

>>> from propy.GetProteinFromUniprot import GetProteinSequence
>>> protein = GetProteinSequence(ProteinID="Q9NQ39")
>>> result = GetSequenceOrderCouplingNumberGrant(protein)
propy.QuasiSequenceOrder.GetSequenceOrderCouplingNumberSW(ProteinSequence: str, maxlag: int = 30, distancematrix={'AA': 0.0, 'AC': 0.112, 'AD': 0.819, 'AE': 0.827, 'AF': 0.54, 'AG': 0.208, 'AH': 0.696, 'AI': 0.407, 'AK': 0.891, 'AL': 0.406, 'AM': 0.379, 'AN': 0.318, 'AP': 0.191, 'AQ': 0.372, 'AR': 1.0, 'AS': 0.094, 'AT': 0.22, 'AV': 0.273, 'AW': 0.739, 'AY': 0.552, 'CA': 0.114, 'CC': 0.0, 'CD': 0.847, 'CE': 0.838, 'CF': 0.437, 'CG': 0.32, 'CH': 0.66, 'CI': 0.304, 'CK': 0.887, 'CL': 0.301, 'CM': 0.277, 'CN': 0.324, 'CP': 0.157, 'CQ': 0.341, 'CR': 1.0, 'CS': 0.176, 'CT': 0.233, 'CV': 0.167, 'CW': 0.639, 'CY': 0.457, 'DA': 0.729, 'DC': 0.742, 'DD': 0.0, 'DE': 0.124, 'DF': 0.924, 'DG': 0.697, 'DH': 0.435, 'DI': 0.847, 'DK': 0.249, 'DL': 0.841, 'DM': 0.819, 'DN': 0.56, 'DP': 0.657, 'DQ': 0.584, 'DR': 0.295, 'DS': 0.667, 'DT': 0.649, 'DV': 0.797, 'DW': 1.0, 'DY': 0.836, 'EA': 0.79, 'EC': 0.788, 'ED': 0.133, 'EE': 0.0, 'EF': 0.932, 'EG': 0.779, 'EH': 0.406, 'EI': 0.86, 'EK': 0.143, 'EL': 0.854, 'EM': 0.83, 'EN': 0.599, 'EP': 0.688, 'EQ': 0.598, 'ER': 0.234, 'ES': 0.726, 'ET': 0.682, 'EV': 0.824, 'EW': 1.0, 'EY': 0.837, 'FA': 0.508, 'FC': 0.405, 'FD': 0.977, 'FE': 0.918, 'FF': 0.0, 'FG': 0.69, 'FH': 0.663, 'FI': 0.128, 'FK': 0.903, 'FL': 0.131, 'FM': 0.169, 'FN': 0.541, 'FP': 0.42, 'FQ': 0.459, 'FR': 1.0, 'FS': 0.548, 'FT': 0.499, 'FV': 0.252, 'FW': 0.207, 'FY': 0.179, 'GA': 0.206, 'GC': 0.312, 'GD': 0.776, 'GE': 0.807, 'GF': 0.727, 'GG': 0.0, 'GH': 0.769, 'GI': 0.592, 'GK': 0.894, 'GL': 0.591, 'GM': 0.557, 'GN': 0.381, 'GP': 0.323, 'GQ': 0.467, 'GR': 1.0, 'GS': 0.158, 'GT': 0.272, 'GV': 0.464, 'GW': 0.923, 'GY': 0.728, 'HA': 0.896, 'HC': 0.836, 'HD': 0.629, 'HE': 0.547, 'HF': 0.907, 'HG': 1.0, 'HH': 0.0, 'HI': 0.848, 'HK': 0.566, 'HL': 0.842, 'HM': 0.825, 'HN': 0.754, 'HP': 0.777, 'HQ': 0.716, 'HR': 0.697, 'HS': 0.865, 'HT': 0.834, 'HV': 0.831, 'HW': 0.981, 'HY': 0.821, 'IA': 0.403, 'IC': 0.296, 'ID': 0.942, 'IE': 0.891, 'IF': 0.134, 'IG': 0.592, 'IH': 0.652, 'II': 0.0, 'IK': 0.892, 'IL': 0.013, 'IM': 0.057, 'IN': 0.457, 'IP': 0.311, 'IQ': 0.383, 'IR': 1.0, 'IS': 0.443, 'IT': 0.396, 'IV': 0.133, 'IW': 0.339, 'IY': 0.213, 'KA': 0.889, 'KC': 0.871, 'KD': 0.279, 'KE': 0.149, 'KF': 0.957, 'KG': 0.9, 'KH': 0.438, 'KI': 0.899, 'KK': 0.0, 'KL': 0.892, 'KM': 0.871, 'KN': 0.667, 'KP': 0.757, 'KQ': 0.639, 'KR': 0.154, 'KS': 0.825, 'KT': 0.759, 'KV': 0.882, 'KW': 1.0, 'KY': 0.848, 'LA': 0.405, 'LC': 0.296, 'LD': 0.944, 'LE': 0.892, 'LF': 0.139, 'LG': 0.596, 'LH': 0.653, 'LI': 0.013, 'LK': 0.893, 'LL': 0.0, 'LM': 0.062, 'LN': 0.452, 'LP': 0.309, 'LQ': 0.376, 'LR': 1.0, 'LS': 0.443, 'LT': 0.397, 'LV': 0.133, 'LW': 0.341, 'LY': 0.205, 'MA': 0.383, 'MC': 0.276, 'MD': 0.932, 'ME': 0.879, 'MF': 0.182, 'MG': 0.569, 'MH': 0.648, 'MI': 0.058, 'MK': 0.884, 'ML': 0.062, 'MM': 0.0, 'MN': 0.447, 'MP': 0.285, 'MQ': 0.372, 'MR': 1.0, 'MS': 0.417, 'MT': 0.358, 'MV': 0.12, 'MW': 0.391, 'MY': 0.255, 'NA': 0.424, 'NC': 0.425, 'ND': 0.838, 'NE': 0.835, 'NF': 0.766, 'NG': 0.512, 'NH': 0.78, 'NI': 0.615, 'NK': 0.891, 'NL': 0.603, 'NM': 0.588, 'NN': 0.0, 'NP': 0.266, 'NQ': 0.175, 'NR': 1.0, 'NS': 0.361, 'NT': 0.368, 'NV': 0.503, 'NW': 0.945, 'NY': 0.641, 'PA': 0.22, 'PC': 0.179, 'PD': 0.852, 'PE': 0.831, 'PF': 0.515, 'PG': 0.376, 'PH': 0.696, 'PI': 0.363, 'PK': 0.875, 'PL': 0.357, 'PM': 0.326, 'PN': 0.231, 'PP': 0.0, 'PQ': 0.228, 'PR': 1.0, 'PS': 0.196, 'PT': 0.161, 'PV': 0.244, 'PW': 0.72, 'PY': 0.481, 'QA': 0.512, 'QC': 0.462, 'QD': 0.903, 'QE': 0.861, 'QF': 0.671, 'QG': 0.648, 'QH': 0.765, 'QI': 0.532, 'QK': 0.881, 'QL': 0.518, 'QM': 0.505, 'QN': 0.181, 'QP': 0.272, 'QQ': 0.0, 'QR': 1.0, 'QS': 0.461, 'QT': 0.389, 'QV': 0.464, 'QW': 0.831, 'QY': 0.522, 'RA': 0.919, 'RC': 0.905, 'RD': 0.305, 'RE': 0.225, 'RF': 0.977, 'RG': 0.928, 'RH': 0.498, 'RI': 0.929, 'RK': 0.141, 'RL': 0.92, 'RM': 0.908, 'RN': 0.69, 'RP': 0.796, 'RQ': 0.668, 'RR': 0.0, 'RS': 0.86, 'RT': 0.808, 'RV': 0.914, 'RW': 1.0, 'RY': 0.859, 'SA': 0.1, 'SC': 0.185, 'SD': 0.801, 'SE': 0.812, 'SF': 0.622, 'SG': 0.17, 'SH': 0.718, 'SI': 0.478, 'SK': 0.883, 'SL': 0.474, 'SM': 0.44, 'SN': 0.289, 'SP': 0.181, 'SQ': 0.358, 'SR': 1.0, 'SS': 0.0, 'ST': 0.174, 'SV': 0.342, 'SW': 0.827, 'SY': 0.615, 'TA': 0.251, 'TC': 0.261, 'TD': 0.83, 'TE': 0.812, 'TF': 0.604, 'TG': 0.312, 'TH': 0.737, 'TI': 0.455, 'TK': 0.866, 'TL': 0.453, 'TM': 0.403, 'TN': 0.315, 'TP': 0.159, 'TQ': 0.322, 'TR': 1.0, 'TS': 0.185, 'TT': 0.0, 'TV': 0.345, 'TW': 0.816, 'TY': 0.596, 'VA': 0.275, 'VC': 0.165, 'VD': 0.9, 'VE': 0.867, 'VF': 0.269, 'VG': 0.471, 'VH': 0.649, 'VI': 0.135, 'VK': 0.889, 'VL': 0.134, 'VM': 0.12, 'VN': 0.38, 'VP': 0.212, 'VQ': 0.339, 'VR': 1.0, 'VS': 0.322, 'VT': 0.305, 'VV': 0.0, 'VW': 0.472, 'VY': 0.31, 'WA': 0.658, 'WC': 0.56, 'WD': 1.0, 'WE': 0.931, 'WF': 0.196, 'WG': 0.829, 'WH': 0.678, 'WI': 0.305, 'WK': 0.892, 'WL': 0.304, 'WM': 0.344, 'WN': 0.631, 'WP': 0.555, 'WQ': 0.538, 'WR': 0.968, 'WS': 0.689, 'WT': 0.638, 'WV': 0.418, 'WW': 0.0, 'WY': 0.204, 'YA': 0.587, 'YC': 0.478, 'YD': 1.0, 'YE': 0.932, 'YF': 0.202, 'YG': 0.782, 'YH': 0.678, 'YI': 0.23, 'YK': 0.904, 'YL': 0.219, 'YM': 0.268, 'YN': 0.512, 'YP': 0.444, 'YQ': 0.404, 'YR': 0.995, 'YS': 0.612, 'YT': 0.557, 'YV': 0.328, 'YW': 0.244, 'YY': 0.0})[source]

Compute the sequence order coupling numbers from 1 to maxlag for a given protein sequence based on the Schneider-Wrede physicochemical distance matrix.

Parameters:
  • ProteinSequence (str) – a pure protein sequence
  • maxlag (int, optional (default: 30)) – the maximum lag and the length of the protein should be larger than maxlag
  • distancematrix (Dict[Any, Any]) – contains Schneider-Wrede physicochemical distance matrix
Returns:

Tau – contains all sequence order coupling numbers based on the Schneider-Wrede physicochemical distance matrix

Return type:

Dict[Any, Any]

Examples

>>> from propy.GetProteinFromUniprot import GetProteinSequence
>>> protein = GetProteinSequence(ProteinID="Q9NQ39")
>>> result = GetSequenceOrderCouplingNumberSW(protein)
propy.QuasiSequenceOrder.GetSequenceOrderCouplingNumberTotal(ProteinSequence: str, maxlag: int = 30) → Dict[Any, Any][source]

Compute the sequence order coupling numbers from 1 to maxlag for a given protein sequence.

Parameters:
  • ProteinSequence (str) – a pure protein sequence
  • maxlag (int, optional (default: 30)) – the maximum lag and the length of the protein should be larger
Returns:

result – contains all sequence order coupling numbers

Return type:

Dict

Examples

>>> from propy.GetProteinFromUniprot import GetProteinSequence
>>> protein = GetProteinSequence(ProteinID="Q9NQ39")
>>> result = GetSequenceOrderCouplingNumberTotal(protein)
propy.QuasiSequenceOrder.GetSequenceOrderCouplingNumberp(ProteinSequence: str, maxlag: int = 30, distancematrix: Dict[Any, Any] = None)[source]

Compute the sequence order coupling numbers from 1 to maxlag for a given protein sequence based on the user-defined property.

Parameters:
  • ProteinSequence (str) – a pure protein sequence
  • maxlag (int, optional (default: 30)) – the maximum lag and the length of the protein should be larger than maxlag.
  • distancematrix (Dict[Any, Any]) – contains 400 distance values
Returns:

Tau – contains all sequence order coupling numbers based on the given property

Return type:

Dict[str]

Examples

>>> from propy.GetProteinFromUniprot import GetProteinSequence
>>> protein = GetProteinSequence(ProteinID="Q9NQ39")
>>> result = GetSequenceOrderCouplingNumberp(protein)