pKALM: Accurate and Rapid Protein pKa Prediction: Protein Language Models Reveal the Sequence-pKa Relationship

pKALM is a web server for predicting protein pKa values from amino acid sequences, based on protein language model and transfer learning. It achives high accuracy and speed compared to existing methods. It supports the pKa prediction of eight ionizable groups: N-terminal, C-terminal, and six ionizable side chains of Asp, Glu, His, Lys, Cys, and Tyr, with 0.8321 RMSE and a speed of 4,961 pKa / sec. [GitHub | Paper | License]


The prediction is quite fast and the results is a CSV-formatted table with 5 columns:

  1. The first column is the sequence ID.
  2. The second column is the index of the ionizable group.
  3. The third column is the name of the ionizable group.
  4. The fourth column is the predicted pKa shift values.
  5. The fifth column is the predicted pKa values.

Significantly shifted pKa values are highlighted in yellow. The predicted pI are also attached at the end of the table.

Prediction Results

References

Users are kindly requested to utilize the following citation when referencing this method:

Please contact shijie.xu@ees.hokudai.ac.jp for any questions.

Changelogs