big-PI Plant Predictor is a web server utilizing a scoring algorithm for prediction of GPI modification sites in plants.
get_big_pi(data, ...) # S3 method for character get_big_pi(data, ...) # S3 method for data.frame get_big_pi(data, sequence, id, ...) # S3 method for list get_big_pi(data, ...) # S3 method for default get_big_pi( data = NULL, sequence, id, simplify = TRUE, sleep = 1, progress = FALSE, ... ) # S3 method for AAStringSet get_big_pi(data, ...)
data | A data frame with protein amino acid sequences as strings in one column and corresponding id's in another. Alternatively a path to a .fasta file with protein sequences. Alternatively a list with elements of class |
---|---|
... | currently no additional arguments are accepted apart the ones documented bellow. |
sequence | An appropriate column name if a data.frame is supplied to data argument. If .fasta file path, or list with elements of class "SeqFastaAA" provided to data, this should be left blank. |
id | An appropriate column name if a data.frame is supplied to data argument. If .fasta file path, or list with elements of class "SeqFastaAA" provided to data, this should be left blank. |
simplify | A boolean indicating the type of returned object, defaults to TRUE. |
sleep | A numeric indicating the pause in seconds between server calls, at default set to 1. |
progress | Boolean, whether to show the progress bar, at default set to FALSE. |
http://mendel.imp.ac.at/gpi/plant_server.html
If simplify == TRUE:
A data frame with columns:
Character, indicating the protein identifier
Logical, did big-Pi predict the presence of a GPI
Character, indicating the quality of the highest scoring omega-site
Integer, indicating the sequence position of the highest scoring omega-site
Numeric, indicating the p-value for the prediction of the highest scoring omega-site
If simplify == FALSE:
A list of predictions, each element named according to the sequence id, containing a two element list:
data frame, resembling the one returned by simplify == TRUE, along with alternative site predictions (if present)
data frame, with profile dependent and profile independent scores
If the server is unable to make a prediction due to non-amino acid letters or length of the sequence, the returned prediction is NA (is.bigpi column).
Eisenhaber B. Wildpaner M. Schultz CJ. Borner GHH. Dupree P. Eisenhaber F. (2003) Glycosylphosphatidylinositol lipid anchoring of plant proteins. Sensitive prediction from sequence- and genome-wide studies for Arabidopsis and rice. Plant Physiology 133(4): 1691-701
library(ragp) data(at_nsp) #indexes of some sequences in at_nsp ind <- c(129, 145, 147, 160, 170) big_pi_pred <- get_big_pi(sequence = at_nsp$sequence[ind], id = at_nsp$Transcript.id[ind], simplify = FALSE) big_pi_pred <- get_big_pi(data = at_nsp[ind,], sequence = sequence, id = Transcript.id, simplify = TRUE) big_pi_pred#> id is.gpi Quality omega_site PValue #> 1 AT2G20700.1 TRUE P 135 2.672529e-06 #> 2 AT2G13820.1 TRUE P 147 4.831245e-05 #> 3 AT2G48130.1 TRUE P 157 5.483314e-06 #> 4 AT2G03505.1 TRUE P 146 1.938260e-06 #> 5 AT2G45040.1 TRUE S 317 6.007296e-04