Protein Similarity Calculation based on Gene Ontology (GO) Similarity
Source:R/par-02-parGOSim.R
parGOSim.Rd
This function calculates protein similarity based on Gene Ontology (GO) similarity.
Arguments
- golist
A list, each component contains a character vector of GO terms or one Entrez Gene ID.
- type
Input type for
golist
,"go"
for GO Terms,"gene"
for gene ID.- ont
Default is
"MF"
, can be one of"MF"
,"BP"
, or"CC"
subontologies.- organism
Organism name. Default is
"human"
, can be one of"anopheles"
,"arabidopsis"
,"bovine"
,"canine"
,"chicken"
,"chimp"
,"coelicolor"
,"ecolik12"
,"ecsakai"
,"fly"
,"human"
,"malaria"
,"mouse"
,"pig"
,"rat"
,"rhesus"
,"worm"
,"xenopus"
,"yeast"
or"zebrafish"
. Before specifying the organism, please install the corresponding genome wide annotation data package for the selected organism.- measure
Default is
"Resnik"
, can be one of"Resnik"
,"Lin"
,"Rel"
,"Jiang"
or"Wang"
.- combine
Default is
"BMA"
, can be one of"max"
,"average"
,"rcmax"
or"BMA"
for combining semantic similarity scores of multiple GO terms associated with proteins.
Author
Nan Xiao <https://nanx.me>
Examples
if (FALSE) { # \dontrun{
# Be careful when testing this since it involves GO similarity computation
# and might produce unpredictable results in some environments
library("GOSemSim")
library("org.Hs.eg.db")
# By GO Terms
# AP4B1
go1 <- c(
"GO:0005215", "GO:0005488", "GO:0005515",
"GO:0005625", "GO:0005802", "GO:0005905"
)
# BCAS2
go2 <- c(
"GO:0005515", "GO:0005634", "GO:0005681",
"GO:0008380", "GO:0031202"
)
# PDE4DIP
go3 <- c(
"GO:0003735", "GO:0005622", "GO:0005840",
"GO:0006412"
)
golist <- list(go1, go2, go3)
parGOSim(golist, type = "go", ont = "CC", measure = "Wang")
# By Entrez gene id
genelist <- list(c("150", "151", "152", "1814", "1815", "1816"))
parGOSim(genelist, type = "gene", ont = "BP", measure = "Wang")
} # }