Phobius web server is a combined transmembrane topology and signal peptide (N-sp) predictor. Currently only "normal prediction" of signal peptides is supported by the function.

get_phobius(data, ...)

# S3 method for character
get_phobius(data, progress = FALSE, ...)

# S3 method for data.frame
get_phobius(data, sequence, id, ...)

# S3 method for list
get_phobius(data, ...)

# S3 method for default
get_phobius(data = NULL, sequence, id, ...)

# S3 method for AAStringSet
get_phobius(data, ...)

Arguments

data

A data frame with protein amino acid sequences as strings in one column and corresponding id's in another. Alternatively a path to a .fasta file with protein sequences. Alternatively a list with elements of class SeqFastaAA resulting from read.fasta call. Alternatively an AAStringSet object. Should be left blank if vectors are provided to sequence and id arguments.

...

currently no additional arguments are accepted apart the ones documented bellow.

progress

Boolean, whether to show the progress bar, at default set to FALSE.

sequence

A vector of strings representing protein amino acid sequences, or the appropriate column name if a data.frame is supplied to data argument. If .fasta file path, or list with elements of class "SeqFastaAA" provided to data, this should be left blank.

id

A vector of strings representing protein identifiers, or the appropriate column name if a data.frame is supplied to data argument. If .fasta file path, or list with elements of class "SeqFastaAA" provided to data, this should be left blank.

Source

http://phobius.sbc.su.se/

Value

A data frame with columns:

id

Character, name of the submitted sequence.

tm

Integer, the number of predicted transmembrane segments.

sp

Character, Y/0 indicator if a signal peptide was predicted or not.

prediction

Character string, predicted topology of the protein.

cut_site

Integer, first amino acid after removal of the signal peptide

is.phobius

Logical, did Phobius predict the presence of a signal peptide

sp.length

Integer, length of the predicted signal peptide.

Details

The topology (prediction column of the result) is given as the position of the transmembrane helices separated by 'i' if the loop is on the cytoplasmic or 'o' if it is on the non-cytoplasmic side. A signal peptide is given by the position of its h-region separated by a n and a c, and the position of the last amino acid in the signal peptide and the first of the mature protein separated by a /.

Note

This function creates temporary files in the working directory.

References

Kall O. Krogh A. Sonnhammer E. L. L. (2004) A Combined Transmembrane Topology and Signal Peptide Prediction Method. Journal of Molecular Biology 338(5): 1027-1036

See also

Examples

library(ragp) data(at_nsp) phobius_pred <- get_phobius(at_nsp[1:20,], sequence, Transcript.id) phobius_pred
#> id tm sp prediction cut_site is.phobius #> 1 ATCG00660.1 0 Y n17-24c29/30o 30 TRUE #> 2 AT2G43600.1 0 Y n6-17c22/23o 23 TRUE #> 3 AT2G28410.1 0 Y n6-17c22/23o 23 TRUE #> 4 AT2G22960.1 0 Y n8-16c20/21o 21 TRUE #> 5 AT2G19580.1 3 Y n7-18c26/27o42-64i71-95o233-253i 27 TRUE #> 6 AT2G19690.2 0 Y n10-23c28/29o 29 TRUE #> 7 AT2G19690.1 0 Y n10-23c28/29o 29 TRUE #> 8 AT2G33130.1 0 Y n7-20c26/27o 27 TRUE #> 9 AT2G05520.1 0 Y n5-18c23/24o 24 TRUE #> 10 AT2G05520.2 0 Y n5-18c23/24o 24 TRUE #> 11 AT2G05520.3 0 Y n5-18c23/24o 24 TRUE #> 12 AT2G05520.4 0 Y n5-18c23/24o 24 TRUE #> 13 AT2G05520.6 0 Y n5-18c23/24o 24 TRUE #> 14 AT2G05520.5 0 Y n5-18c23/24o 24 TRUE #> 15 AT2G04031.1 0 Y n8-16c24/25o 25 TRUE #> 16 AT2G43620.1 0 Y n11-21c28/29o 29 TRUE #> 17 AT2G37390.1 0 Y n6-16c24/25o 25 TRUE #> 18 AT2G37390.2 0 Y n6-16c24/25o 25 TRUE #> 19 AT2G35250.1 0 Y n5-16c21/22o 22 TRUE #> 20 AT2G38390.1 0 Y n7-22c29/30o 30 TRUE #> sp.length #> 1 29 #> 2 22 #> 3 22 #> 4 20 #> 5 26 #> 6 28 #> 7 28 #> 8 26 #> 9 23 #> 10 23 #> 11 23 #> 12 23 #> 13 23 #> 14 23 #> 15 24 #> 16 28 #> 17 24 #> 18 24 #> 19 21 #> 20 29