Protein Similarity Calculation based on Gene Ontology (GO) Similarity
Source:R/par-02-parGOSim.R
twoGOSim.Rd
This function calculates the Gene Ontology (GO) similarity between two groups of GO terms or two Entrez gene IDs.
Arguments
- id1
Character vector. When length > 1: each element is a GO term; when length = 1: the Entrez Gene ID.
- id2
Character vector. When length > 1: each element is a GO term; when length = 1: the Entrez Gene ID.
- type
Input type of id1 and id2,
'go'
for GO Terms,"gene"
for gene ID.- ont
Default is
"MF"
, can be one of"MF"
,"BP"
, or"CC"
subontologies.- organism
Organism name. Default is
"human"
, can be one of"anopheles"
,"arabidopsis"
,"bovine"
,"canine"
,"chicken"
,"chimp"
,"coelicolor"
,"ecolik12"
,"ecsakai"
,"fly"
,"human"
,"malaria"
,"mouse"
,"pig"
,"rat"
,"rhesus"
,"worm"
,"xenopus"
,"yeast"
or"zebrafish"
. Before specifying the organism, please install the corresponding genome wide annotation data package for the selected organism.- measure
Default is
"Resnik"
, can be one of"Resnik"
,"Lin"
,"Rel"
,"Jiang"
or"Wang"
.- combine
Default is
"BMA"
, can be one of"max"
,"average"
,"rcmax"
or"BMA"
for combining semantic similarity scores of multiple GO terms associated with proteins.
Author
Nan Xiao <https://nanx.me>
Examples
if (FALSE) { # \dontrun{
# Be careful when testing this since it involves GO similarity computation
# and might produce unpredictable results in some environments
library("GOSemSim")
library("org.Hs.eg.db")
# By GO terms
go1 <- c("GO:0004022", "GO:0004024", "GO:0004023")
go2 <- c("GO:0009055", "GO:0020037")
twoGOSim(go1, go2, type = "go", ont = "MF", measure = "Wang")
# By Entrez gene id
gene1 <- "1956" # EGFR
gene2 <- "2261" # FGFR3
twoGOSim(gene1, gene2, type = "gene", ont = "BP", measure = "Lin")
} # }